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The goal of our proposal “A Complete Public Archive for the Einstein IPC” was to 
provide for the community a flexible archive and analysis system. The need for such a 
system has been apparent for some time, and the progress made at Columbia over the past 
ten years in understanding and characterizing fully this ground-breaking X-ray astronomy 
instrument, coupled with the enormous advances in computing power since the original 
mission, make such a system both desirable and practical. The continued interest and 
scientific value of the IPC data is apparent from the publication statistics compiled by Dr. 
F.D. Seward at the Smithsonian Astrophysical Observatory: in 1990 alone, 13 years after 
the launch (and 9 years after the demise) of the mission, 91 papers using Einstein data 
appeared in the literature. With the advent of several new X-ray missions in the 1990s 
(two in progress and four more to come), the value of these archival data will only increase. 
Placing the powerful Columbia analysis system in the public domain under the aegis of 
the HE AS ARC offers permanent access to this important dataset. 

Our primary goal is now accomplished. In early September, after extensive interaction 
with HE AS ARC personel, we completed the transfer of the full IPC archive along with a 
documented set of software which provides for the user a suite of flexible tools with which 
to derive fluxes and upper limits for any celestial position covered by the 4082 pointings 
conducted during the mission. The tools include the flat-fielding algorithm developed at 
Columbia to correct IPC images for the substantial 1-30 arcminute-scale features in the 
response of the instrument which seriously compromised some early analysis efforts. A 
hypercube of flat-fields derived from the full, source-subtracted dataset is invoked when an 
image is to be analyzed, and a customized flat-field corresponding precisely to the editing 
criteria used on the image is applied (an option for the user to employ the global flat field 
using all energy channels and apsect criteria is offered for cases in which the data-editing 
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flags applied to the image produce a corresponding flat field with insufficient statistical 
precision). Since the user may select from all possible energy ranges, aspect qualities, and 
viewing geometries in any combination, complete flexibility is achieved. 

The source search algorithm used for analyzing a selected, fiat-fielded image has un- 
dergone the most improvements over the past two years. The version supplied with the 
system can be utilized to search a single field, or can perform a source search to a user- 
specified threshold over the entire dataset. A complete description of the algorithm is 
provided in our paper ’’The Einstein Two-Sigma Catalog” (Moran et al. 1996) which has 
recently been accepted for publication in the Astrophysical Journal, and will appear next 
April (preprint enclosed). 

In addition to these fundamental improvements, a number of I/O and other enhance- 
ments were included in the final version of the program (dubbed OpEd at Columbia for 
historical, uninteresting reasons). These include: 

1. Along with the traditional ascii output maps, the three basic OpEd tasks now produce 
FITS output: 

SCAN - produces 3 FITS images of the raw, exposure and flux maps 

EXAM - produces individual FITS PHA files of each source found by SCAN 

TIME - produces a FITS table of the light curve output 

Some effort has been put into ensuring that all the FITS output is consistent with 
OGIP/GSFC conventions, the FITS format defined in Astronomy and Astrophysics 
Supplement Series v44/p363, v44/p371, v73/p359, v73/p365, and the NASA Science 
Office of Standards and Technology for the FITS Definition, document # 100. 
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2. Modifications to the majority of the routines and subroutines from the original ver- 
sion of OpEd which has been running on the SUNs at Columbia since 1993 include a 
number of minor upgrades and bug fixes. In general, the changes are transparent to 
the casual user; all have been documented in the source code. In particular, informa- 
tion concerning machine-dependent changes to the source code axe discussed in the 
introductory comments to each OpEd program. 

3. SCAN now runs entirely on 32" binned pixels instead of 64" pixels. The FITS images 
are 240 x 240 32" pixels on the side (the digit_map ascii output is, however, coadded 
to 120 x 120 pixels). 

4. The all-sky SCAN scripts are now installed as a new task option within OpEd. 

5. OpEd continues to work by reading the highly compressed IPC binary files for fastest 
I/O. These files are identical to those in the IPC database at Columbia. However, for 
these SUN/IEEE files to be read correctly on the HEASARC machine LEGACY they 
must be byte-swapped. To do this, programs were devloped which identify every type 
of float and integer in the headers and the data records of the ASP, TGR and XPR 
files and treat them correctly. On LEGACY, once a selection has been made with 
the OpEd task SELECT and an ipclist is generated, the SUN files are read by the 
appropriate byte-swapping routines and rewritten into the user’s working directory. 
All other OpEd routines then work on these files. Since files will accumulate in this 
directory, the old OpEd task REMOVE has been reinstated. In keeping with the 
OpEd philosophy of very fast I/O, all this overhead should eventually be bypassed by 
running these byte swapping routines on the entire database to convert the files in 
one shot. If this is done by the HEASARC, the only modification needed to OpEd is 
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to delete calls to the byte-swap routines in the SELECT script. 

In summary, the primary task of this program has been successfully completed: The 
IPC archive is installed as a public resource with a documented, flexible software system 
available for archive access and analysis. 

In addition to this task, we proposed to place in the public domain a number of specific 
data products of potential use to the community. The status of this work is as follows: 

• The Two-Sigma Catalog: As the enclosed preprint demonstrates, this task is essen- 

tially complete. A comprehensive paper describing our final catalog of all isolated 
sources down to 2-sigma significance has been accepted for publication. A total of 
~ 50,000 entries are included. A careful analysis of the catalog statistics indicates 
that over 13,000 are real celestial X-ray emitters, an increase of 300% over the offi- 
cial Einstein catalog. Techniques for generating from this catalog samples containing 
~ 90% real X-ray sources are presented. The full catalog will be available on line 
through both the HEASARC and EINLINE facilities. 

• Hard and Soft Source Catalogs: These were generated as part of the two-sigma cat- 
alog effort, and will be made available as adjuncts to the main catalog. 

• Extended Source Catalog: This effort is also complete, and a paper describing our 
detection of several hundred extended sources on scales of 2 — 10' is in preparation. 
Statistical analysis of the catalog and optical followup observations which demonstrate 
its utility in the study of galaxy clusters are included in this publication. 

• Burst Catalog: The entire database was searched for short-timescale (~ 1 — 100 s) 
bursts and approximately 40 such events were found which could not be attributed to 
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any counter malfunction or environmental influence. The catalog is included in a paper 
in press in the Astrophysical Journal (Gotthelf, Hamilton and Helfand 1995). The 
most interesting scientific result of this work is the limits it places on the possibility 
for a halo or gin of gamma- ray bursts. 

In conclusion, our work to create and disseminate a complete public archive of raw 
data, analysis tools, and derived catalogs is virtually complete. With the final publication 
of these results in the coming year, and the completion of catalog installation at the 
HEASARC and EINLINE facilities, the project will be terminated on September 23, 1996 
as currently provided for in the contract to Columbia University. 
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ABSTRACT 

To facilitate the study of X-ray sources fainter than those contained in the Einstein 
Medium Sensitivity Survey (EMSS), we have constructed a new catalog of sources and 
fluctuations exceeding 2a significance in 2520 high-latitude Einstein IPC images. We have 
employed various tests to validate our source-search algorithm for both high- and low- 

significance sources, and to identify and remove the small number of spurious sources 

s 

induced by our detection procedure. Based on the known vignetting and background 
characteristics of the IPC and the measured X-ray log JV-log S relation, we have modelled 
the number of real sources expected in the catalog in order to evaluate its statistical 
properties below 4<r significance. Our modelling suggests that ~ 13,000 sources in the 
catalog are real celestial X-ray sources, an increase of ~ 9,100 over the number found in 
previous analyses of the same IPC images. We find that not only is the reliability of the 
Two-Sigma Catalog a function of source significance cr, it is a function of off-axis angle 
on the detector as well. The application of different signal-to-noise thresholds at different 
off-axis angles thus enables one to time the reliability of the catalog. 

The chief motivation for studying large numbers of faint X-ray sources is to search for 
possible new components of the cosmic X-ray background. To select out real celestial X-ray 
sources in the Two-Sigma Catalog, we apply astronomical catalogs at other wavelengths 
as filters. For example, the cross-correlation of the Two-Sigma Catalog with catalogs from 
surveys of the radio and infrared sky has yielded large samples of faint X-ray sources that 
are ~ 90% reliable. Optical spectroscopy of 77 unidentified faint X-ray sources has turned 
up several surprises, illustrating the merits of selecting X-ray sources using a variety of 
methods: high-redshift quasars (one at 2 = 4.30), which are absent in the EMSS, X-ray- 
luminous (Lx ~ 10 43 ergs s -1 ) radio- loud elliptical galaxies with optical spectra devoid of 
emission lines, and infrared-bright AGNs whose optical spectra are dominated by starburst 
galaxy features. Followup observations are scheduled to determine if any of these types of 
objects represents a previously unrecognized component of the X-ray background. 
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1. Introduction 

Progress in X-ray astronomy has been regulated historically by the number of ce- 
lestial high-energy photons available for study. Advancements in mirror and detector 
technologies, and the resulting gains in the sensitivity, angular resolution, and spectral 
resolution of successive generations of X-ray observatories, have led to steady improve- 
ments in our understanding of the X-ray sky. For example, the use of focussing optics and 
position-sensitive detectors on the Einstein Observatory resulted in the detection of several 
thousand discrete X-ray sources, a substantial increase over the number found with all the 
nonimaging instruments that preceded it. Einstein sources were detected with arcminute 
positional accuracy, making, for the first time, identification of the optical counterparts 
of X-ray sources an efficient process. As a consequence, the study of Einstein sources has 
largely shaped our current knowledge of discrete X-ray source populations. 

The Imaging Proportional Counter (IPC) was the primary instrument used on the 
Einstein Observatory. In the years before the launch of ROSAT, the ~ 4000 deg 2 of IPC 
images represented a unique resource for X-ray astronomers, providing them with ample 
incentive to characterize fully the IPC, reexamine important early IPC results, and exploit 
further the IPC data. Better understanding and calibration of the IPC permitted the 
construction of the Einstein Extended Medium Sensitivity Survey (EMSS; Gioia et a 1. 
1990; Stocke et a 1. 1991), which contains nearly eight times as many sources as the original 
large-area survey (MSS; Maccacaro et a 1. 1982; Gioia et a 1. 1984) and remains the largest 
complete, unbiased, fully identified sample of X-ray sources ever assembled. 

The study of the 1-3.5 keV X-ray background (XRB) and discrete source contributions 
to it have perhaps benefited most from more careful treatments of the IPC data. Several 
important papers have appeared in the literature in past years, fueling the debate over 
the origin of the XRB. For instance, the luminosity function for active galactic nuclei 
(AGNs) in this band derived from the EMSS by Maccacaro et a 1. (1991) demonstrated 
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that AGNs (of the types represented in the EMSS) contribute no more than ~ 40% of the 
XRB flux. Arcminute-scale fluctuation analyses of deep IPC images (Hamilton & Helfand 
1987; Barcons & Fabian 1990) showed that the unresolved component of the XRB must 
come from a population of sources which has a surface density of > 1000 deg -2 , further 
constraining the role of quasars. The development of a flat-fielding technique for the IPC 
and the determination of the particle contribution to the IPC background allowed Wu et 
a i. (1991) to derive the IPC spectrum of the XRB. Using these techniques and employing 
a new source-search algorithm, Hamilton, Helfand, & Wu (1991; cf. Primini et al. 1991) 
reexamined the Einstein Deep Survey fields of Giacconi et al. (1979) and Griffiths et al. 
(1983), obtaining more accurate measurements of the extragalactic log N(> S )- log S 
relation and the fraction of the XRB in these fields attributable to resolved sources. 

With improved characterization of the IPC has come a steady decrease in the source 
detection signal-to-noise threshold used by various authors; 5a was used in the original 
MSS and Deep Survey studies, 4 a in the EMSS, and 3.5a in the Deep Survey reanalysis of 
Hamilton et al. (1991). In principle, this threshold can be driven down further, depending 
on the intended application of the results and one’s ability to determine their statistical 
reliability. In this paper we describe the generation and properties of a catalog of sources 
and fluctuations in IPC images exceeding 2a. Our aim is to cover more sky at fainter flux 
levels than the EMSS, to complement that survey. We demonstrate several ways in which 
such a catalog can be used to yield new information about various classes of X-ray sources. 
In §2, we outline our selection criteria for the IPC data examined and the source detection 
algorithm employed in the construction of the catalog. The statistical properties of the 
catalog and subsequent adjustments designed to improve its reliability are discussed in §3. 
Applications of the catalog are described in §4. Optical spectroscopy for 77 of the new 
X-ray source counterparts reveals several fascinating discoveries including the detection of 
high-redshift (z ss 4) quasars which are absent in the EMSS sample. 
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2. The IPC Two-Sigma Catalog 

In order to interpret properly the contents of an astronomical catalog, the data se- 
lection criteria and source detection algorithm used to construct it must be understood in 
detail. This is particularly true for a catalog which purports to contain sources at the 2<r 
level. Therefore, in this section we describe how IPC data were selected for analysis and 
how the selected images were searched for sources. We also present the results of two tests 
intended to validate our procedures. 

2.1. Manufacture of the Two-Sigma Catalog 
2.1.1. Data Selection 

There are nearly 4100 IPC images covering ~ 1 deg 2 each in the Einstein database. 
Exposure times for these fields range from approximately 100 to 56,000 seconds and point- 
ings are located all over the sky. At 2.0<r, the contents of our catalog will be dominated 
by spurious detections. We specifically want to exclude, therefore, any data in which the 
spurious detection rate is likely to be higher than that dictated by statistics alone. Since 
the X-ray images used in the EMSS were chosen carefully to include only the cleanest data, 
several of the sound guidelines for selection of the data used in that survey were adopted 
here (see Gioia et a 1. 1990). However, because we do not have the same ambitions for 
completeness as the EMSS, we have included more (and slightly different) data. 

Sources of bright, diffuse emission are the chief breeding ground for large numbers 
of spurious 2cr sources independent of the detection algorithm employed. Therefore, we 
excluded all IPC images centered within 10° of the Galactic plane (eliminating most Galac- 
tic supernova remnants) and fields within the boundaries of the Magellanic Clouds (which 
have regions of intense diffuse emission; Wang et a 1. 1991). This biases the mix of real 
celestial sources in the catalog towards extragalactic objects, but allows us to minimize the 
confusion caused by Galactic stars when we set out to identify their optical counterparts 
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(§4). Next, we visually inspected the 2981 images remaining after these exclusions for obvi- 
ous sources of diffuse emission. On the basis of this inspection, we excluded 187 additional 
fields containing bright Abell galaxy clusters, regions of diffuse emission in the Virgo clus- 
ter, high-latitude supernova remnants, and regions crowded with Galactic sources, such as 
the Orion and Pleiades fields. As discussed in §3.1 below, even slightly extended sources 
can lead to the detection of several spurious sources. Therefore, we also excluded any im- 
age in which a target Abell cluster appears more extended than ~ 10'. In addition, several 
IPC targets were observed multiple times, usually for variability studies. If one image in a 
series of redundant images had substantially more exposure time (at least ~ twice as much 
as any others), we excluded the shorter exposure fields. If, on the other hand, shorter 
exposure fields had different roll angles and sampled somewhat different parts of the sky, 
or if no image in a series offered a clear advantage in exposure time, redundant images 
were kept. Finally, images with less than 300 s of good exposure time were omitted. 

The total number of IPC images used to construct our catalog is 2520, as compared to 
1435 included in the EMSS. We exclude in each image four 4' strips centered on the window 
support ribs, but we have not excluded data near the center of the field; thus, our catalog 
includes the targets of the IPC observations as well as serendipitous sources elsewhere in 
the field of view. The area of sky examined in each field is 0.75 deg 2 , compared to 0.6 
deg 2 in the EMSS. Excluding redundant fields, our catalog covers 1850 deg 2 , a factor of 
2.3 more than the EMSS. The mean and median exposure times for our sample of images 
are 3900 s and 2200 s, respectively. The distribution of image exposure times for the data 
used to construct our catalog is shown in Figure 1. 

2.1.2. Source-search Algorithm 

Having selected only data which will not enhance the number of spurious sources 
in our catalog, we also want to be certain that the detection algorithm we use does not 
systematically generate spurious sources. Our source-search algorithm is based on the one 
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developed by Hamilton et a 1. (1991) for their reanalysis of the IPC Deep Survey fields, 
and several of our steps are identical to their’s. However, when we used their algorithm 
explicitly to construct a preliminary version of the Two-Sigma Catalog, systematics were 
found to be present in the catalog below ~ 3.5cr (discussed further in §2.2.2). Thus, we were 
forced to interrogate each step in the source detection procedure to reduce the spurious 
source fraction and to improve the accuracy of our detections of real sources. There are 
no substantive philosophical differences between our new procedure and that described 
by Hamilton et a 1. (1991); our modifications have merely optimized the algorithm for the 
detection of sources fainter than those they considered. This has come at the expense 
of computing efficiency; however, since the entire photon event database is on-line at 
Columbia, the complete search takes as little as 12 hours. 

Our procedure begins with the construction of two maps for each satellite orbit (called 
a HUT): one for the counts and one for the exposure time (which is initially constant across 
an image and is equal to the “good” exposure time for the HUT). We use data in the pulse- 
height invariant (PI) channels 2-10, which covers the energy range 0.16-3.5 keV, collected 
when the sun- angle viewing geometries were most favorable (i.e., VG = 1-3). We have 
opted to bin the IPC data in 32" x 32" pixels, rather than pixels four times as large, as was 
done by Wu et ad. (1991) and Hamilton et a I. (1991). This permits use of a detection cell 
which better samples the IPC point response function. The IPC fiat field (Wu et ad. 1991) 
is applied to the exposure map for each HUT, accounting for energy-dependent vignetting 
due to the Einstein optics and spatial nonuniformities in the IPC response. Next, the 
count maps for all HUTs in an observation are combined in sky coordinates, as are the 
exposure maps. 

In the next step, we compute the signal-to-noise ratio at each pixel in the image as 
though a source were centered there. Potential source counts ns are collected within 1.25 
arcmin of this central pixel and background counts are collected from within a concentric 
annulus with inner and outer radii of 3' and 6'. In practice, this means that there me 
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21 pixels in the source aperture, covering 5.97 arcmin 2 , and 304 pixels in the background 
annulus, covering 86.5 arcmin 2 (if both source and annulus are unshadowed). The number 
of background counts ns expected within the source aperture is the number of counts in 
the background annulus weighted by the relative areas of the source and annulus and the 
mean flat field values in the two regions. The signal-to-noise ratio is computed using the 
formula a = ns/ y/ns + ns- This procedure is carried out for every pixel in the IPC image. 

The search for sources occurs in an iterative fashion in which the signal-to-noise thresh- 
old for detection, <r th , is lowered with each successive iteration. We perform five iterations, 
with <7 t h = 10, 6, 4, 3, and 2. In any iteration, the “most significant” pixel, i.e., the one 
centered on the source with the highest signal-to-noise ratio a exceeding <x t h, is examined 
first for a source. Providing the source meets the sampling requirements — at least ~ 60% 
of the source pixels and ~ 30% of the annulus pixels are unshadowed and undeleted — the 
source is considered detected. After the source parameters are recorded, the pixel values for 
the source region are blanked in the count and exposure maps. The radius to which data 
are deleted is proportional to In a, an empirically determined value found to be suitable for 
typical-spectrum sources over a wide range of a (however, see §3.1 below). The procedure 
continues on to the next most significant source with cr > cr t h- If within an iteration the 
number of undeleted source or annulus pixels for a source decreases due to the excision of a 
brighter nearby source, the detection of this source is deferred to the next iteration. When 
no more sources with a above the current threshold are left, the signal-to-noise ratios are 
recomputed at each of the remaining pixels, excluding the deleted data, and <r th is lowered 
appropriately for the next search iteration. 

The advantages of the use of a flat field and a locally-determined background for the 
detection of faint IPC sources were discussed in detail by Hamilton et ai. (1991). We have 
made several minor modifications to their detection scheme, the most salient of which we 
s umm arize here. 
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• We search for sources in order of their brightness, not by scanning, as in the 
sliding-box methods often employed in automatic search procedures. This ensures 
that the significance of a source is not compromised by the presence of brighter 
sources in its background annulus and greatly reduces the number of “edge effect” 

2 a sources found near the edges of the IPC and along the window support ribs. 

• We iteratively decrease the signal-to- noise threshold for detection, a t h, in several 
stages, recomputing a at each undeleted pixel between iterations. Furthermore, 
we defer detection of sources “corrupted” by the deletion of a nearby brighter 
source to a subsequent search iteration. Again, these strategies improve the 
accuracy of our computed source significances. 

• We have increased the size of the background annulus and have made more 
stringent requirements for the minimum area used to compute the background 
level. This also reduces the number of spurious detections at the lower values of 
a. 

In concert, these modifications have improved the accuracy of our detections and have 
afforded us consistent, predictable behavior regarding the detection of the faintest sources, 
as we demonstrate in §2.2. 

A total of 49,537 sources are found in 2520 IPC images using the above procedure. 
Some sources in redundant images appear in the catalog more than once. In such cases 
we keep only the entry with the highest signal-to-noise ratio. After removal of duplicate 
sources and the spurious “satellite sources” identified in §3.1, 46,186 sources remain, 4764 
of which have significances greater than 3.5 a. This compares to 3912 > 3.5<r sources in the 
same 2520 images in the Einstein 2E Catalog, assembled by the Smithsonian Astrophysical 
Observatory (SAO). 

How many of the entries in the Two-Sigma Catalog are celestial X-ray sources? As a 
first attempt to answer this question, we have plotted in Figure 2 the average number of 
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sources detected per image as a function of exposure time. The number of sources (at all 
a ) rises steeply from ~ 3 per field at the shortest exposure times to ~ 25 per field at an 
exposure time of 5000 s, then increases more slowly to a maximum value of ~ 60 per field. 
In other words, there is an increase of a factor of eight over the first decade in exposure 
time, and a factor of two increase over the second. By contrast, the mean number of 
> 4 o sources per field, which should indicate how the relative number of real sources per 
field depends on exposure time, increases by a factor of ~ 10 over both decades. Spurious 
sources must be largely responsible for the distribution of the upper set of points in Figure 
2, but the exact nature of their contribution is not clear. In §3.2 we discuss statistical 
methods for understanding the spurious source contribution to the catalog. 

2.2. Validation of the Catalog 

The ultimate utility of the Two-Sigma Catalog depends entirely on the effectiveness 
of the source-search algorithm discussed above. We have performed two tests designed to 
validate our algorithm for the detection of both high- and low-significance sources. 

2.2.1. Comparison to EMSS Sources 

The EMSS contains 835 serendipitously discovered sources detected above 4 a in 1435 
high-latitude IPC images (Gioia et a 1. 1990). Being the largest single compilation of 
carefully selected Einstein sources assembled to date, the EMSS serves naturally as an 
important benchmark for extensions of the IPC database such as our catalog. Although 
the EMSS sources were extracted using procedures very different from ours (see Harnden 
et a 1. 1984), a comparison of the signal-to- noise ratios for our detections of EMSS sources 
with those published in the EMSS itself is an excellent means for examining the behavior 
of our source detection algorithm for high-significance sources. 

Figure 3 displays the results of that comparison. We have applied our source-detection 
program to the same 1435 images used to construct the EMSS. Immediately clear from 
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Figure 3 is that all sources fall very close to the identity line with little dispersion about 
that line. The dispersion appears to increase somewhat at lower values of a with 4 a EMSS 
sources having significances in our catalog between ~ 3a and 6a. Our versions of the 
EMSS sources actually tend to have somewhat higher signal-to-noise ratios, but only by 
an average amount of 0.15a. Thus, our automatic source-search procedure appears to work 
as well as, if not better than, detection schemes employed in the past for high-significance 
IPC sources. However, most of the sources in our catalog are below 4a, and the important 
test for our source detection algorithm concerns its behavior regarding weaker sources. 

2.2.2. Location of Sources on the IPC 

It is difficult to measure how well a procedure detects real sources below 4a because so 
many of the “sources” at that level of significance are in fact spurious. A useful diagnostic 
is provided by the distribution of source positions in the IPC detector coordinate system. 
While this distribution cannot provide a quantitative means for testing the efficacy of the 
source detection scheme, it provides a powerful qualitative one. For instance, we would 
expect the distribution of sources on the IPC to be approximately uniform with an excess 
at the center of the detector due to targets. In fact, the plot of the source distribution 
for an early version of our catalog showed strong enhancements and deficits of sources in 
regions parallel to the charge collection wires in the IPC and in the corners of the field. 
Furthermore, we found that the pattern changed as parameters such as the size of the 
background annulus were varied. 

Figure 4 (top panel) shows where sources in our final catalog are found in detector 
coordinates. The plot contains only sources detected in IPC images with single roll angles, 
since sources in fields with multiple roll angles have more than one set of corresponding 
detector coordinates. Thus, ~ 39,000 of the ~ 46,000 sources in the catalog are represented 
in Figure 4, and 93% of these are below 4a. The excess of sources at the center of the 
detector due to targets, and the lack of sources where data near the ribs have been deleted, 


- 12 - 


are obvious. Otherwise, it appears that the distribution of sources is roughly uniform. 
Some faint structure parallel to the charge collection wires remains present in the source 
distribution. But compare this source distribution to the IPC flat field, displayed in the 
bottom panel of Figure 4, which also shows structure parallel to the charge collection wires. 
The pattern of sources on the IPC mimics in detail the flat field. Darker regions on the 
flat field image represent areas where the detector is more sensitive. Thus, we are finding 
more sources at places where the detector is more sensitive, exactly what should happen 
if the excess sources are real. 

To test this assertion, we divided the Two-Sigma Catalog into two parts, separating 
sources detected where the flat field values are above the median value from those detected 
where the flat field values are below the median. Sixty-three percent of the sources are 
detected where the flat field is above the median value. Next, we cross-correlated each 
subcatalog with the Green Bank 6 cm radio catalog (this procedure is described in §4.1.1). 
Seventy-two percent of the X-ray sources that match 6 cm radio sources are from the 
“above-the-median” subcatalog, and the fraction of matches expected to arise by chance 
is significantly less for these (9% versus 12%). This tells us that the excess sources found 
where the detector is more sensitive are predominantly real sources. This result is an 
important confirmation of the flat-fielding procedure and a strong endorsement of our 
detection algorithm. In the next section we determine exactly what fraction of sources in 
the catalog, as a function of the signal-to-noise ratio, are real celestial X-ray sources. 

3. Statistical Properties of the Two-Sigma Catalog 

Having established that our methods for constructing the IPC Two-Sigma Catalog 
are qualitatively well-behaved, quantitative analysis is needed to determine the usefulness 
of the catalog and its limitations. In this section we explore a way of finding and removing 
spurious sources that are artifacts of our source detection scheme. We then model the 
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number of real X-ray sources expected to be present in the catalog, thereby deter mining 
the spurious source detection rate as a function of the signal-to-noise ratio and location 
on the detector. 


3.1. Removal of “Satellite” Sources 

We are confident that searching for sources in order of significance, rather than scan- 
ning for them, makes the most sense when a local background is used and when sources 
are deleted after detection. However, in the implementation of our detection algorithm 
we have had to assume that all sources we detect are point sources and that they have 
approximately the same spectrum, neither of which is strictly true. In certain instances, 
these assumptions lead to insufficient data being deleted after the detection of a source. 
For bright, extended sources the reasons are obvious; this is why we were careful to exclude 
images containing extended sources and diffuse emission. But similar problems can arise 
for some point sources as well. The point response function (PRF) for the IPC is charac- 
terized by two components: (a) a quasi-Gaussian core, the exact shape and width of which 
are determined by the geometric figure of the optics and the event-centroiding capabilities 
of the IPC, and (b) non-Gaussian wings, due to X-rays being scattered to wide angles by 
mir ror microroughness. Both of these components have a strong spectral dependence. For 
soft X-rays, the core of the PRF is broader than it is for hard X-rays because soft X-rays 
deposit less charge on the detector wires, making the determination of event centroids less 
accurate. Mirror scattering, on the other hand, is much more severe for hard X-rays, and 
a higher fraction of them are scattered out of the core and into the wings. Thus, especially 
soft- or hard-spectrum sources (e.g.. stars or some AGNs) will tend to have a non-negligible 
fraction of counts outside the deletion radius which has been set for sources with typical 
spectra. These undeleted counts might then be detected as independent “satellite” sources 
in a subsequent iteration of our detection procedure. 


-14- 


To test for this effect, we cross-correlated the Two-Sigma Catalog with itself to see 
if sources tend to be found close to bright sources above the level expected by chance. 
For a given “primary” source of significance cti, only matches to it with sources having 
a < <7i in the same image were considered. The accumulated results for the number 
of matches to primary sources as a function of positional offset are displayed in Figure 
5. The primary sources are grouped by <7i, and the number in each offset bin has been 
normalized by the area of the annulus represented by the radial range of that bin. The 
resulting distribution is constant at each offset if the two sets of objects being compared 
are positionally uncorrelated, making the identification of a correlation straightforward. 

Each panel in Figure 5 has qualitatively the same characteristics. The n um ber of 
matches is zero out to the deletion radius determined by cr x and is approximately constant 
at large position offsets, as expected. However, at offsets just beyond the deletion radius, 
excesses of sources are observed. These are satellite sources and should be deleted from the 
catalog. Of coruse, there are also real celestial X-ray sources at small offsets as well, but 
the fraction of these determined from the observed source density at a given a is low (19- 
29%). Instead of removing every source within a certain distance of a bright X-ray source, 
we only remove those below 5a to recover some of the real sources at small offsets. This 
procedure flags 1,729 entries in our catalog, the removal of which leaves 46,186 sources in 
the final Einstein Two-Sigma Catalog. It is interesting that a strong correlation is observed 
even in the 4-6 <7 range for primary sources. We suspect that the satellite sources in these 
cases are largely due to faint but extended primary sources rather than point sources with 
unusually hard or soft spectra, since the magnitude of the number of undeleted counts 
in the latter case is expected to be small. This could be confirmed by the new search 
for extended sources in the IPC database being conducted by Gaidos, Oppenheimer, & 
Helfand (1995). 
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3.2. The N(> a)-a Relation 

With the removal of the satellite sources, the Two-Sigma Catalog is now largely free of 
sources induced by our detection procedure. But before we use the catalog for astronomical 
purposes, we would like to gauge its reliability. A potentially useful diagnostic is the 
median signal-to-noise ratio <r m , which can be computed for various ranges of exposure 
times and/or positions on the IPC. Exposure times for the IPC images included in our 
study vary by more than two orders of magnitude, and in a single image, the effective 
exposure time varies by roughly a factor of three between the center and the edge of 
the field of view. Luckily, the X-ray log N(> S)-log S has an approximately constant 
slope over the range of limiting sensitivities represented in the data (Hasinger et a 1. 1993). 
Therefore, cr m for real X-ray sources in our catalog should be independent of exposure time 
and location on the IPC. In addition, spurious sources— the fraction of which increases as 
cr decreases — should not show any preference for location on the IPC; their presence will 
decrease a m but should not alter its spatial constancy. However, as displayed in Figure 
6, a m for sources in our catalog decreases with off-axis angle. This must mean that, at a 
given value of cr, the fraction of spurious sources increases with off-axis angle. Thus, we 
have an indication that the reliability of the catalog is a function of source location on the 
detector. 

Being optimists, we have decided to model the fraction of real — rather than the 
spurious — sources in the Two-Sigma Catalog. This begins with the assumption that the 
soft X-ray log N(> 5) -log 5, expressed 

N(> S ) = N 0 S~ a , (1) 

has a constant slope a, which has been measured to be 1.5 using the EMSS (Gioia et a i. 
1990) and 1.7 at somewhat lower fluxes using ROSAT (Hasinger et a 1. 1993). We adopt 
a = 1.5 in our calculations below. Next, we assume that the spectra of the X-ray sources 
are similar and, therefore, that a direct proportionality exists between the count rate for 
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an X-ray source and its energy flux. The counts-to-flux conversion customarily adopted 
for Einstein IPC sources is 1 ct s -1 = 2.4 x 10 -11 ergs cm -2 s -1 . This corresponds to the 
assumption of a power-law spectrum with T = 2 and an absorbing column Nyi = 2 x 10 20 
cm ~ 2 . From the definition of the signal-to-noise ratio a = St* /y/S + B; where S and B 
are the source and background counting rates and t is the exposure time, we obtain an 
expression for S in terms of a: 


S = 


a 2 -(- y/a 4 + 4:Btcr 2 
2 1 


( 2 ) 


Substituting this into equation (1), the N(> a) relation is derived: 


N(> a) = Nq 


a 2 + y/a 4 + 4 Bta 2 
2 1 


( 3 ) 


Ultimately, we wish to compare directly the number of sources expected in a particular a 
interval with the actual number present in the Two-Sigma Catalog; therefore, it is not the 
integrated number of sources above some a that we are after, but rather the differential 
source counts n(a), given by 

OO 

n(cr') da' . (4) 

a 

Thus, differentiating equation (3) and applying the appropriate limits, we obtain the ex- 
pression for the differential source counts: 



n{a) = - 


dN(> a) 
da 


1 — a— 1 


= aNo(2t) a [ a 2 + (a 4 + ABta 2 ) 2 
x \%a + i(<7 4 + 4 Bta 2 )~% (4 a 3 + 8I?fcr)j . 


( 5 ) 


We have used IPC. images with a wide range of exposure times; in addition, since vi- 
gnetting affects some sources of background but not others, the background characteristics 
vary across the face of the detector. Thus, equation (5) must be modified to accommodate 
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these circumstances. To deal with exposures of different duration, we compute n(a) for 
exposure times t spaced at 500s intervals over their entire range, weight the calculations by 
a normalized distribution function fit), and sum the results. The function f(t) is derived 
from the image exposure times displayed in Figure 1. In addition, vignetting due to the 
Einstein optics substantially reduces the effective exposure time at off-axis field angles 9, 
so exposure times must be modified by the normalized vignetting function v(9) (see Wu 
et a 1. 1991). 

The adjustment for spatial variations in the background is more complicated. Both 
source counts and background counts due to the X-ray background and solar-scattered 
X-rays are vignetted, but the particle background is not (Wu et a 1. 1991). Hence, the 
background rate B consists of two parts: Bx for the vignetted X-ray background, and Bp 
for the particle background. Bp was determined in two independent ways by Wu et a l. 
(1991), who derived a mean value of ~ 1.4 x 10 -4 ct s -1 arcmin -2 . The total background 
rates for source-subtracted IPC images are strongly clustered near 1.6 ct s _1 (Fig. 7); 
thus, the mean photon background rate is ~ 4.5 x 10 -4 ct s -1 arcmin -2 . Assigning B\ 
the appropriate on-axis value of 7.9 x 10 -4 ct s -1 arcmin -2 , Bxv{9 ) summed over 9 and 
weighted by area gives the above mean background rate for photons. 

Finally, we would like to compute n(cr) in different radial zones on the IPC. Different 
normalization constants are therefore necessary for the different zones, and we replace the 
constant No with C(9). Thus, equation (5) transforms to the following: 


n(cr, 9) = ^2 OiC{Q)f{t)[2tv(d)] a \a 2 + [cr 4 -I- 4 (Bxv{9) -I- Bp)tcr 2 ]* 


— a — 1 


t L 


X 


2 a + i[cr 4 -I- 4 (B\v(0) + Bp)ta 2 } * [4<r 3 + 8 (B\v(6) + Bp)tcr] 
z 


( 6 ) 


Plots of n(<7, 9) together with the signal- to-noise distribution for sources in the Two- 
Sigma Catalog are displayed in Figure 8 for 5 arcmin-wide annuli centered on the optical 
axis of the IPC. We have omitted in Figure 8 3,350 sources detected in fields with total 
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background counting rates in excess of 2.3 ct s -1 since we have assumed a constant value 
of 1.6 ct s _1 in the calculations. The normalization constants C{Q) were determined by 
numerically integrating equation (6) over the limits [5,oo] for each annulus and setting the 
result equal to the actual number of cataloged sources above ba in that annulus. 

At off-axis angles greater than 5', the signal-to- noise distributions of sources in Figure 
8 are fit very well by the model above 4 a in each radial range. Targets of the IPC observa- 
tions contaminate the distribution in the first panel of Figure 8, causing the normalization 
determined by the number of > ba sources to be too high. However, this provides added 
confidence that the model fits off-axis, where sources were detected serendipitously, are 
accurate. Between 3.5 and 4 a an obvious break occurs in the source distributions. At the 
lower values of a, it is clear that the departure of the data from the model, for a given 
a, increases as the off-axis angle is increased. Thus, our inference above that the fraction 
of spurious sources increases with 6 is confirmed. The break occurs right near the detec- 
tion thresholds used in the EMSS and Deep Survey reanalysis, adding support to those 
investigators’ choices for the signal- to-noise threshold a t h- 

The results of the modelling indicate that 28%, or ~ 13,000, of the entries in the 
Two-Sigma Catalog are celestial X-ray sources, assuming all the sources above 4<j are real. 
Below 4a, this fraction is 22%. We are not bound by our initial choice of 2cr for the 
catalog’s signal-to-noise threshold, and our estimates of the reliability of the catalog as a 
function of both a and 6 permit us to apply cr-thresholds which vary with off-axis angle. 
This is certain to improve the reliability of any results obtained with the catalog. We 
could, for example, require that at a given off-axis position at least 20% of the sources at 
every value of a be real and adjust a t h accordingly. Thus, we would set <r th = 2.0 at 9 < 
15 arcmin, cr t h = 2.3 for 15 < 9 < 20 arcmin, cr t h = 2.5 for 20 < 9 < 25 arcmin, and so on. 
With such adjustments to <r t h, the number of real sources in the catalog drops only to ~ 
10,500, whereas the fraction of sources which are real climbs to 45%. Adjustments of this 
sort can be used in combination with other methods for selecting out the celestial X-ray 
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sources in the catalog; we describe the use of filters to enhance substantially the number 
of real sources in a subsample in the following section. 

4. Applications: Selection and Study of the “Real” 

Two-Sigma IPC Sources 

Having characterized in detail the Two-Sigma Catalog, we can now use it for astro- 
nomical purposes. The primary challenge lies in the determination of which 28% of the 
entries in the catalog are the real celestial X-ray sources. The catalog itself is not reliable 
enough to make the direct study of sources at the 2cr level possible. We could, therefore, 
limit ourselves to sources detected above 4cr to avoid spurious sources, as in the EMSS; 
this would provide us with ~ 1400 new X-ray sources, after omission of the targets and 
previously identified EMSS sources. However, all such sources will have fluxes at or above 
the EMSS sensitivity threshold, and we would have the arduous task of seeking their 
optical counterparts in the rather large ~ 1' radius IPC position error circles. An alter- 
native approach is to apply filters (i.e., astronomical catalogs at other wavelengths) to the 
Two-Sigma Catalog in order to isolate the real celestial X-ray sources from the statistical 
fluctuations. The advantages of this technique are that, depending on the choice of filter, 
one can explore fainter X-ray sources of types not well represented in other surveys and 
efficiently locate their optical counterparts. The disadvantage of this procedure is that 
a certain amount of contamination due to chance coincidences and false X-ray sources is 
admitted, but the level of contamination is easily determined. Since it took over ten years 
to complete 96% of the optical identifications of the 835 EMSS sources, we have opted for 
the second approach. As will be discussed below, large samples of X-ray sources which are 
~ 90% reliable can be obtained this way, even though the catalog itself contains only 28% 
real sources. The filters we have used are catalogs from surveys of the radio and infrared 
sky. 
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The principle motivation for studying new populations of faint X-ray sources is to 
search for new components of the cosmic X-ray background (XRB). The XRB is now 
thought to arise from the integrated emission of faint discrete X-ray sources (Mather et a 1. 
1990). However, known classes of l umin ous extragalactic X-ray sources, such as quasars, 
Seyfert galaxies, and galaxy clusters, have failed to demonstrate the spectral and spatial 
properties required to account for the origin of the XRB (Fabian & Barcons 1992). Two 
promising candidates at this time are star- forming galaxies ( e.g ., Griffiths & Padovani 
1990) and obscured Seyfert galaxies (e.g., Madau, Ghisellini, & Fabian 1994; Comastri et 
a 1. 1995). An excellent means for isolating both of these types of galaxies is by infrared 
selection. Therefore, we have cross- correlated the IPC Two-Sigma Catalog with the IRAS 
Faint Source Catalog (FSC). In addition, although radio-loud objects are not expected to 
make a major contribution to the XRB, we have cross-correlated the Two-Sigma Catalog 
with the Green Bank 6 cm Northern Sky Catalog (Becker, White, & Edwards 1991) and 
the Texas 80 cm interferometric catalog (Douglas, private communication). As we discuss 
below, fascinating X-ray sources not represented in the EMSS have been found in both the 
radio and infrared samples. 


4.1. Radio Selection 

Correlations between radio and X-ray emission have been observed spanning several 
orders of magnitude in luminosity for a wide variety of objects, including elliptical and SO 
galaxies (Fabbiano, Gioia, & Trinchieri 1989), 3CR radio galaxies (Fabbiano et aJ. 1984), 
and quasars (Worrall et a 1. 1987). The correlations are usually strongest for the nuclear 
(or “core” ) radio power and the soft X-ray luminosity. Complete radio information exists 
for the EMSS AGNs, which has been used to compare the properties of radio-loud and 
radio-quiet AGNs (Della Ceca et a 1. 1994). Hamilton & Helfand (1993) found a significant 
correlation between sub-mJy radio sources and > 2a fluctuations in deep Einstein IPC. 
images, concluding that star-forming galaxies, which make up the majority of sources at 
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these radio fluxes, could contribute significantly to the soft X-ray background. However, 
Boyle et al. (1993), in a similar study involving a deep ROSAT image, found only a marginal 
correlation between X-ray and radio sources. This debate, plus the finding by Brinkmann, 
Siebert, & Boiler (1994) that a considerable number of Molonglo radio sources contained 
in the ROSAT All-Sky Survey sources are radio- and X-ray-bright but optically faint, has 
motivated us to examine the properties of radio-selected sources in the Two-Sigma Catalog. 
We have utilized high- and low-frequency samples of radio sources. An advantage of using 
radio selection for X-ray sources is that radio positions can be determined to arcsecond 
precision, which greatly simplifies the identification of their optical counterparts. 

4.1.1. Cross-correlation Statistics and Followup VLA Observations 

The Green Bank 6 cm catalog derived by Condon, Broderick, & Seielstad (1989) was 
compiled by Becker et al. (1991). The catalog contains 53,522 sources at declinations in the 
range 0° < <5 < 75°, with a flux limit of 40 mjy at the equator and 20 mJy at the highest 
declinations. The positions of the 6 cm sources are accurate to 50" (2 o). Positions of 
sources in the unpublished Texas 80 cm catalog (Douglas, private communication) can be 
accurate to ~ 5", but are sometimes ambiguous because of the interferometer’s sidelobes. 
The flux limit of the Texas catalog, which contains 67,751 sources, is 250 mjy and the 
catalog covers the declination range —25° < 5 < 72°. Texas sources are assigned quality 
flags which can be used to select those sources with the most accurately measured positions 
and fluxes. 

The results of the cross-correlations between the Einstein Two-Sigma Catalog and the 
6 cm and 80 cm radio catalogs are displayed in Figures 9a and 10a, respectively. The 
number of matches as a function of the X-ray/radio position offset, normalized by the area 
of the ann ulus represented in each offset bin, are plotted. The reason for plotting the offset 
distribution in this manner is that two uncorrelated catalogs will have a constant area- 
normalized number in each offset bin; correlations are revealed by peaks above this constant 
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level at small offsets, and the constant level establishes directly the chance coincidence 
rate. In both Figures 9a and 10a strong correlations are observed. To verify that the 
constant level in each plot accurately represents the chance coincidence rate, we shifted the 
positions of the radio catalog sources by some arbitrary amount, say 15', and reperformed 
the matches. This simulates the results obtained for two uncorrelated catalogs, and, as 
indicated by the dashed lines in Figures 9 and 10, confirms the chance coincidence level in 
each graph. 

The chance coincidence rate is known at every position offset. Therefore, our sample 
of radio-selected X-ray sources can be defined on the basis of a desired level of reliability 
by accepting only those matches with offsets smaller than some limit. If we choose our 
offset limit to be 60" in both cross-correlations, we find that 598 X-ray sources match 6 
cm radio sources with an 87% “real rate” (i.e., 13% are chance coincidences), and that 
589 X-ray sources match 80 cm radio sources at a real rate of 85%. To illustrate the 
usefulness of applying a position-dependent signal-to-noise threshold er th (described above 
in §3.2), if we require that at least 20% of the X-ray sources at every a and off-axis angle 
be real, we remove more than 23,000 sources from the Two-Sigma Catalog, only 2,500 
of which are expected to be real. Upon reperforming the cross-correlation, the number 
of matches of radio sources to X-ray sources falls to 516 and 502 for 6 cm and 80 cm 
catalogs, respectively, for X-ray /radio position offsets of 60" or less. However, as indicated 
by Figures 95 and 106, the chance coincidence level drops significantly: 92% of the 6 cm 
coincidences are expected to be real, and 91% of the 80 cm coincidences are expected to be 
real. The chance coincidence level in each cross-correlation is determined by the relative 
surface densities of sources in the catalogs being compared. Thus, the removal of 50% of 
the X-ray sources in the Two-Sigma Catalog, most of which are spurious, is the reason the 
chance coincidence rates have decreased so dramatically. 

Snapshots of a large fraction of the radio sources (from both samples) in the RA 
range 0 h -13 h were made using the VLA in A array at 20 cm. We obtained additional 
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snapshot observations for sources at other hour angles during short interv als when the 
VLA configuration was being changed. The snapshots, which were typically of 1 min 
duration with limiting sensitivities well below those of either the Green Bank or Texas 
surveys, provided us with radio source positions accurate to ~ l". These positions were 
then used to make optical finding charts from the digitized POSS plates using the GASP 
system at STScI. Additional finding charts were made using the positions of 357 Texas 80 
cm sources (not observed with the VLA) with the highest quality position and flux flags. 
Approximately half of the finding charts show optical counterparts above the POSS plate 
limit (m < 20), many of which are bright enough to observe spectroscopically using 2-3m 
class telescopes. After checking those with optical counterparts for prior classification, we 
proceeded to the telescope to classify previously unidentified X-ray /radio sources. 

4.1.2. Optical Spectroscopy of Unidentified IPC/Radio Sources 

Optical spectroscopy of radio-selected Einstein sources was conducted primarily in 
three observing runs. In 1991 November, using the 3m Shane telescope at Lick Observatory, 
we obtained spectra in the wavelength range 3800-8100 A with 7 A (FWHM) resolution. 
The second two runs, using the KPNO 2.1m telescope and Goldcam spectrograph in 1992 
July and August, provided spectra over the 3700-7400 A range at 4.5 A resolution. A few 
other spectra were obtained with the Lick 3m and the KPNO 4m telescopes during rims 
dedicated to other programs, with spectrograph set-ups similar to those described above. 
Forty spectra were compiled in all. 

The X-ray, radio, and optical data for the IPC/radio sources we observed are listed 
in Table 1. The radio source catalog names (relating to their B1950.0 coordinates), the 
redshifts z, and classifications determined from our spectra are given in the first three 
columns. To clarify a few of the classifications used, broad- and narrow-line radio galaxies 
are denoted BLRG or NLRG, Sey and LINER are used for Seyfert and Low Ionization 
Nuclear Emission-Line Region galaxies, E denotes an early-type galaxy spectrum devoid 
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of strong emission lines, and Cl/BL is used for the one object which could be a cluster or 
BL Lac object. The 6 cm, 20 cm, and 80 cm radio flux densities (in mJy) appear next, 
followed by the 5 GHz radio power (in W Hz -1 ) for each object. Radio fluxes at 20 cm 
are taken from the Green Bank 20 cm Northern Sky Catalog of White & Becker (1992). 
For objects with 80 cm flux densities only, Se was estimated by extrapolating from the 80 
cm measurement assuming a power-law spectrum with index a = 0.7. The Galactic H I 
column (in units of 10 20 cm -2 ) in the direction of each object (Stark et a 1. 1992), the IPC 
sequence number, the signal-to-noise ratio of the X-ray detection, and the IPC count rate 
are provided in the next four columns of Table 1. The X-ray fluxes (xlO -13 ergs cm -2 s -1 ) 
in the 0.16-3.5 keV IPC band which follow assume a 1 keV thermal spectrum for normal 
galaxies or a power law with photon index T = 2 for AGNs, in addition to the tabulated 
Galactic Nn value. The X-ray luminosity Lx (in ergs s -1 ) was computed assuming H 0 = 
50 km s -1 Mpc -1 and qo — 0. The radio/X-ray position offsets (in arcsec) are listed last. 

The classification of our quasars was straightforward. Classifications of other emission- 
line spectra were made on the basis of the emission-fine flux ratios and velocity widths 
(Veilleux & Osterbrock 1987), incorporating the radio luminosity information. Seyfert 
galaxies usually have radio powers at 20 cm below a few 10 24 W Hz -1 (Ulvestad & Wilson 
1989); therefore, we adopt a limit of 10 24 W Hz -1 at 5 GHz to distinguish radio galaxies 
from the lower radio power Seyferts and LINERs. BL Lac objects were classified using the 
criterion established by Stocke et a I. (1991) that the Ca ii break “contrast” be less than 
25%. For one BL Lac, the 18cr X-ray source 0458+6530, we could not determine a redshift. 
None of the early-type galaxies in our list appear to be members of galaxy clusters on the 
POSS images, including the one (1201+2814) supposedly situated in the cluster Abell 1451 
(which previously did not have a published redshift) . We discuss below the possibility that 
some of our early-type galaxies are actually in clusters. 

Table 1 indicates that most of the objects we observed are exactly what we would 
expect to find in a radio- and X-ray-selected sample: quasars (17), broad- and narrow-line 


-25- 


radio galaxies (4), BL Lac objects (3), lower luminosity AGNs (4), and radio-loud early- 
type galaxies (10) (see Tananbaum et a 1. 1979; Fabbiano et a 1. 1984; Morris et a 1. 1991; 
Fabbiano et a 1. 1989). In two cases, the optical object closest to the radio position turned 
out to be a star; we did not search the ~ 1' radius IPC error circle for more plausible 
counterparts to the X-ray sources in these instances. One early-type galaxy (1832+6845 
at z = 0.205) is tentatively classified as a cluster/BL Lac object. It has an elliptical galaxy 
continuum with an extremely strong [O II] A3727 line and only weak [O ill] A5007 and 
H/3 emission, characteristics which typify galaxies in cooling flow clusters (see Allen et a 1. 
1992). An associated cluster, however, is not apparent on the POSS finding chart. 

A closer look at Table 1 reveals two surprises: (a) the presence of high redshift (z > 3) 
quasars, which are absent in the EMSS, and (b) the large number of early-type galaxies with 
little or no evidence for emission lines in their spectra yet X-ray luminosities comparable 
to those of broad-line AGNs (i.e., L\ > 10 42 ergs s _1 ). We discuss these two interesting 
subgroups further here. 

High Redshift Quasars. X-ray studies of high redshift quasars provide information 
regarding the environs, evolution, and inherent properties of the most luminous AGNs 
(Elvis et a 1. 1994; Bechtold et a 1. 1994b; Pickering, Impey, & Foltz 1994). At this time, 
however, only ~ 20 X-ray-detected quasars with z > 3 are known; thus, each new example 
adds an important datum for such investigations. One of the first objects observed as 
part of this project was 1745+6228, a quasar at z = 3.87 (Becker, Helfand, & White 
1992). Since then, we have observed two other high-z quasars with redshifts of 3.04 and 
4.30. Thus, 3/40 of the radio-selected Einstein sources we have observed are high redshift 
quasars. By contrast, the highest redshift object of the 835 EMSS sources is a z = 2.83 
quasar. Our use of radio selection may have contributed to this striking difference. On 
the other hand, our examination of the data at fainter flux levels and our development of 
a more sensitive detection procedure must also be partly responsible. 
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The z = 4.30 quasar 1508+5714 is the second most distant X-ray source now known, 
just eclipsed by a recently discovered quasar at z = 4.32 (Henry et a 1. 1994). It was 
detected in IPC sequence number 19143 as a 6.1 a source, and we obtained its spectrum 
in 1994 May at the KPNO 4m (see Fig. 11). 1508+5714 was discovered independently by 
Hook et a 1. (1995), who selected it solely on the basis of its radio properties. The reason 
the quasar is not already a known X-ray source is a mystery. The source is clearly visible 
(three contours above background) in the image of 19143 published in SAO’s Einstein 
Observatory Catalog of IPC X-ray Sources (Harris et a l. 1990) but apparently fails to 
exceed the signal- to-noise limi t of 3.5a used in that catalog. We can only surmise that our 
use of a locally determined background and application of the IPC flat field has afforded 
us a more accurate measurement of the background level in the image and thus a better 
detection. 

We measure a background-subtracted IPC count rate of 0.027 ct s -1 for 1508+5714, 
which makes it the fourth highest flux X-ray quasar with z > 3, and the brightest (and 
most luminous) X-ray quasar above a redshift of 3.4. In Table 2 we have assembled what 
we believe to be a complete listing of all published X-ray-detected quasars with redshifts 
in excess of 3. This includes our z = 3.04 quasar 1126+2342. Because many of the quasars 
in Table 2 have been observed with ROSAT, we have listed PSPC count rates. For those 
quasars with IPC detections only, we have estimated the PSPC count rate by multiplying 
the IPC count rate by a factor of two, which is appropriate for most X-ray sources (see 
the ROSAT Mission Description, Appendix F). Note that 1508+5714 is 50 times brighter 
than the z = 4.32 quasar and thus provides the only feasible opportunity to examine in 
detail quasar X-ray emission at z > 4. We have recently carried out an ASCA observation 
of 1508+5714 to study its X-ray properties in the 2-60 keV (rest) range. 

There is a putative dearth of radio-loud quasars at high redshifts which has prompted 
the suggestion that radio-loud and radio-quiet quasars have evolved differently (Schmidt et 
aJ. 1995). The three high-z quasars in our sample are, by definition, all radio- loud; in fact, 
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1508+5714 and 1745+6224 are currently the highest redshift radio-loud quasars known. It 
would appear, based on the frequency of such quasars in our list, that X-ray selection is 
an efficient means for finding high -2 radio-loud quasars. 

High-LxNormal Early-type Galaxies. One of the surprises uncovered during the EMSS 
optical identification process was severed normal elliptical galaxies with X-ray luminosities 
Lx~ 10 43 ergss -1 (Stocke et a 1. 1991), an order of magnitude more luminous than early- 
type galaxies surveyed previously with Einstein (Fabbiano et a 1. 1989). Radio- and X-ray- 
loud, but optically undistinguished, galaxies are also present amongst Molonglo 408 MHz 
Survey radio sources detected in the ROSAT All-Sky Survey (Brinkmann et a 1. 1994). 
We have found several similar “optically passive” X-ray galaxies: eight of the 40 objects 
listed in Table 1 have normal early-type galaxy classifications and X-ray luminosities which 
range from 10 43 -10 44 ' 5 ergs s -1 in the 0.16-3.5 keV band, typical of broad-line AGNs. Our 
galaxies differ from those contained in the EMSS in that they are all fairly radio-loud 
( PsGHz > 10 24 W Hz -1 ), whereas the EMSS galaxies, with lower average redshifts than 
ours, are mostly undetected at 6 cm (S$ < 1 mJy). However, this difference may simply 
be an artifact of our radio selection and the X-ray emission mechanisms may be the same 
in the two sets of galaxies. 

The most obvious possibilities for the nature of these X-ray-luminous early-type galax- 
ies is that they are either optically weak BL Lac objects, in which starlight dominates the 
continuum, or radio galaxies situated in X-ray-emitting clusters. As discussed above, we 
have already examined the Ca II break contrast in the spectra of these objects. The eight 
galaxies in question here all have break contrasts typical for normal early-type galaxies, 
but this does not rule out some BL Lac activity. Therefore, we obtained deep VLA and 
CCD images for five of the galaxies to test further the BL Lac and cluster hypotheses. The 
CCD images indicate that while none of these galaxies is in a rich cluster, all have nearby 
galaxian companions at small projected separations. Similarly, the VLA images reveal 
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that none of the galaxies have the extended, distorted radio tails common to radio galax- 
ies in the cluster environment. On the other hand, the radio emission is not dominated 
by an unresolved, compact core in any of the galaxies, which convincingly rules out BL 
Lac activity (Kollgaard et a 1. 1992). Instead, the radio morphologies of these galaxies are 
amorphous or double-lobed with signs, in two cases, for a weak core component. The radio 
emission appears to be confined to the optical extent of the galaxy. Figure 12 displays the 
optical spectrum, optical R band image, and VLA 20 cm radio image of 2324+4222, an 
optically passive X-ray galaxy at a redshift of 0.068. The spectrum of 2324+4222 is clearly 
devoid of emission lines. The compact-double radio structure has a size of ~ 15 arcsec 
and is contained well within the optical extent of the galaxy. Several galaxies are located 
within 3 arcmin (~ 350 kpc) of the bright elliptical. 

There are two promising alternatives for the origin of the X-rays in these objects. First, 
some sort of nonstandard AGN may be present. For example, the radio galaxy 3C 264 was 
observed to have a nuclear point-like X-ray source in an Einstein HRI observation, yet its 
optical spectrum is remarkably dull: a starlight-dominated early-type galaxy continuum 
and only the slightest hint of Ha + [N II] emission lines (Elvis et a 1. 1981). The radio 
and X-ray luminosities of 3C 264, as well as its amorphous radio structure (Baum et 
a 1. 1988), are comparable to those of our objects. 3C 264 might therefore represent an 
AGN prototype for our optically passive X-ray galaxies. Alternatively, the X-ray emission 
in these objects could arise in a hot, diffuse medium associated with a small group of 
galaxies. Such emission has been recently discovered with ROSAT and, as noted above, 
all the galaxies for which we have CCD images have nearby companions. As an example, 
the X-ray emission from the galaxy group HCG 62 (Ponman Sc Bertram 1993) extends 
over at least 360 kpc and has a luminosity of 1 x 10 43 ergss -1 in the ROSAT band. We 
have marginal evidence that the closest of our galaxies, 2324+4222 (shown in Fig. 12), 
is extended in the IPC image: the signal-to-noise ratio of the detection of this source 
increases steadily as the aperture within which photons are collected is increased from one 
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( S/N = 3.71 cr) to two ( S/N — 4.05<r) to three ( S/N = 4.42er) times the size of an IPC 
point source. ROSAT HRI observations, scheduled for three of the brightest sources, will 
determine whether the X-ray emission is point-like or extended in these objects. These 
results, along with our radio and optical images, will be published elsewhere (Moran, 
Helfand, & Becker 1995) 


4.2. Infrared Selection 

Infrared galaxies have also been studied extensively at X-ray wavelengths, frequently 
in the context of the XRB. As mentioned above, two promising candidates for the origin 
of the XRB are obscured Seyferts and star-forming galaxies, both of which are expected 
to turn up in IR-selected samples. In a search for obscured Seyfert galaxies, IRAS Point 
Source Catalog (PSC) galaxies within HEAO-1 error boxes were studied by Kirhakos & 
Steiner (1990), who found a large number of type 2 Seyfert galaxies that they interpreted 
to be obscured Seyfert l’s on the basis of their hard X-ray properties. However, the 
identification of these IRAS galaxies as X-ray sources is very uncertain due to the large 
size (1° x 3°) of the HE AO - 1 error boxes. In the soft X-ray regime, the Einstein IPC and 
IRAS PSC databases have been used to establish correlations between soft X-ray and far- 
infrared luminosities for normal and star-forming galaxies (David, Jones, & Forman 1992) 
as well as for broad-line AGNs (Green, Anderson, &c Ward 1992). Optical spectroscopy 
of 11 IPC/PSC sources by Fruscione & Griffiths (1991) further established the presence 
of starburst galaxies in X-ray /IR-selected samples and provided preliminary support to 
the hypothesis of Griffiths & Padovani (1990) that starburst galaxies might contribute 
significantly to the XRB. 

The contribution to the intensity of the soft XRB by IR-bright galaxies has been 
estimated in various ways by the above authors to fall between 5% and 40%. Boiler et a 1. 
(1992) cataloged 241 IRAS PSC sources detected in the ROSAT All-Sky Survey, of which 
20 appeared to be “normal” spiral galaxies and starbursts with extreme X-ray luminosities 
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(i.e., Lx> 10 42 ergss -1 ). These, too, could have interesting implications for the XRB 
and we have examined the Boiler et a 1. sample in detail (Moran, Halpern, & Helfand 
1994; Moran, Halpern, & Helfand 1995). To complement these studies of X-ray-bright 
infrared galaxies, we have matched our Einstein Two-Sigma Catalog with the IRAS Faint 
Source Catalog (FSC). Both of these catalogs are substantially deeper than the source lists 
employed in previous works. Most IRAS galaxies are optically bright and axe visible on the 
Palomax plates; thus, given the size and position angle of the IRAS position error ellipse, it 
is usually straightforward to determine the correct optical counterpart even though IRAS 
positions can have substantial uncertainties (~ 1'). 

4.2.1. Cross-correlation statistics 

The IRAS FSC (Moshir et a 1. 1992) contains 173,044 point sources detected in at 
least one of the spectral bands centered at 12, 25, and 60/im and is a factor of ~ 2.5 more 
sensitive than the PSC. The FSC covers Galactic latitudes | b |> 10° at 12 and 25/zm and 
| b |> 20° at 60/im. As described by Moshir et a 1. (1992), sources likely to be galaxies or 
stars can be selected on the basis of their infrared colors. Thus, to choose the “galaxies” , 
we selected the FSC sources detected with high flux-density quality at 60/«n for which 
Si 2 /x < Seen- This FSC subsample consists of 60,944 sources at least 20° from the Galactic 
plane. 

Cross-correlation of the IPC Two-Sigma Catalog with the FSC “galaxies” subsample 
was performed in the same manner as that described above in §4.1.1. The results are 
illustrated in Figure 13a for the full Two-Sigma Catalog. Again, a very strong correlation 
is observed at small offsets and is particularly significant for offsets < 40". The chance 
coincidence rate is slightly larger in this cross-correlation than it was for those with the 
radio catalogs. A total of 371 IRAS sources match IPC sources at offsets < 60", but at a 
true coincidence rate of just 76%. At offsets < 40", there are only 254 matches, but the 
fraction of true coincidences climbs to 86%. The application of an off-axis angle-dependent 
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significance threshold <r th decreases the chance coincidence rates considerably. As above, 
we required that at each off-axis angle at least 20% of the sources in any cr-interval not 
be spurious and reperformed the cross-correlation, the results of which are displayed in 
Figure 136. At offsets < 60", there are now only 309 total matches, but the fraction of 
true coincidences has increased to 83%. At offsets < 40", 90% of the 222 coincidences 
are expected to be real. Optical finding charts were made for the full sample of 371 
IRAS/Einstein sources, again with the GASP system at STScI, using the IRAS source 
positions. The position error ellipses associated with each source were used to determine 
the likely optical counterparts to the IRAS sources. 

4.2.2. Optical Spectroscopy of UnidentiSed IPC/FSC Sources 

Optical spectra of unidentified IPC/FSC sources were obtained primarily in two ob- 
serving runs at the Lick 3m in 1993 July and December using the Kast Double Spectro- 
graph. These spectra cover the wavelength range 3400-8100 A at 5 A (FWHM) resolution. 
Additional spectra with comparable wavelength coverage and spectral resolution were ob- 
tained at the KPNO 2m and 4m. A total of 53 objects were observed; 37 of these survive 
the application of the off-axis angle-dependent signal-to-noise thresholds described above 
and therefore represent the most reliable coincidences in our list. We consider only these 
37 X-ray-detected infrared sources below. None of the objects observed was a previously 
known X-ray source; ten had published redshifts (mainly from redshift surveys only), but 
lacked specific classifications. The redshifts of these ten objects implied interesting X-ray 
luminosities (i.e., Lx > a few 10 41 ergs s -1 ), so we observed them to establish correct clas- 
sifications. An eleventh galaxy, 12134+1257 (= Mrk 764) was classified previously as a 
starburst galaxy by Balzano (1983), but its published redshift indicates an X-ray lumi- 
nosity of Lx = 5.2 x 10 42 ergss _1 . Our observation shows it to be a Seyfert 1.9 galaxy, 
more consistent with its high X-ray luminosity. The other 26 objects in our sample were 
previously unidentified, and none of our objects was a target of an Einstein observation. 
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Table 3 lists the X-ray, IRAS, and optical data for the FSC objects we observed. The 
FSC source name (= B1950.0 coordinates), the redshift, and our spectral classification 
of each object comprise the first three columns. The 60 // and 100// IRAS flux densities 
from the FSC (in Jy) and the computed far-infrared luminosity (in ergs s -1 ) are listed 
in the next three columns. Lfir (=47i\Dl 2 I ? fir ergs s -1 ) is computed using the formulae 
.FpiR = 1.26 x 10 -11 (2.58S60 m + Sioo/z) and L>l = cz(l + z/2)/H 0 , where z is the redshift. 
For objects with flux density upper limits at 100//, we estimated Sioo assuming a(60, 100) 
= —1 in order to compute Lfir. As in §4.1.2, we assume Hq = 50 km s -1 Mpc -1 and 
qo = 0. The Galactic H I column toward each FSC source (xlO 20 cm -2 ), the IPC image 
sequence number containing the X-ray source, the significance of the X-ray detection, and 
the IPC source count rate are given in the next four columns. These data were used to 
compute the X-ray flux (in units of 10 -13 ergs cm -2 s -1 ) and l umi nosity (in ergs s -1 ) 
in the 0.16-3.5 keV band assuming a kT = 1 keV thermal spectrum for normal and H II 
galaxies and a power-law spectrum with photon index T = 2 for AGNs. Finally, the IR/X- 
ray and optical/X-ray position offsets (in arcsec) for each IR/X-ray coincidence are fisted 
in the last two columns. 

All of the objects we observed have emission-line optical spectra. Spectral classifica- 
tions were determined using the emission-fine flux ratio diagnostics described by Veilleux 
& Osterbrock (1987) and Filippenko &; Terlevich (1992) along with the emission-line ve- 
locity width information. Table 3 indicates that most of the objects in our sample fall 
into the categories of Seyfert galaxies (Sey), LINERs, and normal late-type galaxies and 
starburst galaxies (H ii). One object observed, a bright K star, is likely to be both the 
X-ray and IR source. (Our “galaxies” subsample of the IRAS FSC is not completely free 
of stellar sources.) One of the remaining objects is a I Zw 1-type Seyfert galaxy, and three 
others are “starburst/Seyfert composite” galaxies, which we describe more fully below. 
Note that several of the objects are interacting galaxies or close galaxy pairs. Any asser- 
tions we make based on the representation of various classes in our sample are somewhat 
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preliminary since we have a relatively small sample that is not complete. However, there 
should not be any strong selection biases in this sample of serendipitously detected X-ray 
sources. 

We expect there to be six chance coincidences among the 37 objects in our sample. 
In principle, the optical/X-ray position offsets Ao/x (penultimate column of Table 3), 
which are accurate to ~ 5", could help distinguish these chance coincidences. However, 
the largest value of A o/x is 87", and only three objects have offsets in excess of 70". 
Since several EMSS sources have optical/X-ray offsets this large, Ao/x for our objects 
may not be particularly useful for determining the chance coincidences in the sample. The 
significance of the X-ray detection a may be a more suitable discriminator. 

There are apparently exceptional X-ray sources in each of the subclasses of IR- and 
X-ray-emitting objects, which we discuss in detail here. 

Seyfert Galaxies. There are 17 outright Seyfert galaxies in our sample. Twelve indicate 
some degree of type 1 activity (i.e., broad permitted emission lines) and only five are 
type 2 Seyferts. In contrast, optically selected Seyfert galaxy samples usually find type 
1 and type 2 objects in the proportion ~ 1:3 (e.g., Osterbrock & Shaw 1988). Our high 
Seyfert 1 fraction is likely due both to the soft X-ray selection used (cf. Stephens 1989) 
and to the fact that we have considered only unclassified objects; this increases the mean 
redshift of our subsample and biases us against less luminous X-ray sources. One Seyfert 2, 
13049+3437 at z = 0.2224, has an extremely high X-ray luminosity (Lx = 6x 10 43 ergs s -1 ). 
Unfortunately, the object is very faint: a better spectrum is required to provide accurate 
emission-line measurements and to determine whether or not the Ha line has a broad 
component. Included amongst the Seyfert l’s is a I Zw 1-type galaxy, 14362-0657. Such 
objects, also called “narrow-line Seyfert l’s,” are known to be luminous X-ray and infrared 
sources (Halpern &: Oke 1987; Moran et a 1. 1995). 
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One of the Seyferts in our sample, 16075+2838 at z = 0.169, has a remarkable spec- 
trum, which we display in Figure 14. The striking aspect of the spectrum is that the 
forbidden [O in] lines are broader than all other lines in the spectrum, forbidden or per- 
mitted. This is a very rare circumstance in Seyfert galaxies. The velocity width of [O III] 
A5007 is 1480 km s -1 (FWHM), whereas the width of H/3 is only 390 km s -1 . There is 
an obvious blue asymmetry to all the emission lines, which makes it difficult to measure 
accurately the widths of the blended fines such as Ha , [N n], and [S il]. With this single 
spectrum alone we axe not able to solve the puzzle of the emission fine widths. CCD imag- 
ing and higher S/N spectra at greater resolution are needed to decipher the true nature 
of this Seyfert galaxy. 

Fortuitously, 16075+2838 has been observed as part of the FIRST VLA survey of 
the radio sky (described in Becker, White, & Helfand 1995). Its 20 cm flux density is 3.8 
mJy which, at z = 0.169, means its 20 cm radio power is 5.5 xlO 23 W Hz -1 . This radio 
luminosity is among the highest known for Seyfert galaxies (Ulvestad & Wilson 1989). A 
possible explanation for the width of the [O III] fines is that the radio-emitting plasma is 
interacting with the ambient interstellar medium in the host galaxy, giving rise to spatially 
extended [O III] emission with a multiplicity of velocity components. Such effects have been 
observed in some radio-bright Markarian galaxies (Whittle et a 1. 1988), the unusual Seyfert 
galaxy Was 49 (Moran et a i. 1992), and galaxies harboring compact steep-spectrum (CSS) 
radio sources (Gelderman & Whittle 1994). Our spectrum contains most of the fight from 
the galaxy; thus, if such extended [O III] emission is present, the integrated fight spectrum 
mi ght plausibly indicate an unusually broad [O in] velocity width. High-resolution radio 
images at several frequencies would provide important information about this object. 

LINERs. One of the two LINERs in our sample, 16022+1753S, has an X-ray lumi- 
nosity exceptional for its class (Lx= 8 x 10 42 ergs s -1 ; LINER luminosities are typically 
10 42 ergs s' 1 or less). However, it is likely that this object has a weak;, broad Ha line, and 
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its companion (which has a very reddened H n or Sey 2 spectrum) may also contribute to 
the X-ray flux. 

Normal Star-Forming Galaxies. Five of the 13 H II galaxies we observed have L\ in 
the range 1-4 x 10 42 ergs s -1 , which makes them more luminous than all other normal H II 
galaxies detected at X-ray wavelengths, but by less than a factor of 10 (Fabbiano 1989). 
These may indeed be very luminous H II galaxies, which, assuming they axe rare, we 
are finding because we have examined a large sample of faint X-ray sources. But evidence 
continues to mount against the existence of any starburst galaxies with Lx substantially in 
excess of 10 42 ergs s -1 (Moran et a 1. 1995; Halpern, Helfand, & Moran 1995), and we must 
approach this issue cautiously. All five high- Lx H II galaxies are low-significance detections 
and there is likely to be considerable uncertainty in their X-ray fluxes. Furthermore, since 
cr < 3.1 for all five objects, they could all be the expected chance coincidences in our list. 
The X-ray source in one case, 16008+3339, may actually be a nearby M star. On the 
other hand, we know that many of our low-a X-ray sources are real. In particular, our 
z = 3.87 quasar 1745+6228, discussed in §4.1.2, is listed as a 2.59a source in our catalog. 
ROSAT observations have confirmed the existence of this X-ray source (Fink & Briel 
1993). Similarly, the Seyfert 1.9 galaxy 14060+7207, a 3.2a detection in this sample, was 
detected in the ROSAT All-Sky Survey (Boiler et a 1. 1992). The X-ray identifications of 
these possible high-Lx H II galaxies should also be confirmed with other X-ray observations. 

“Starburst /Seyfert Composite” Galaxies. With the exception of 16075+2838, the 
most surprising result from the optical identification of these IPC/FSC sources has been 
the discovery of three objects whose optical spectra indicate both vigorous star formation 
and Seyfert activity. We tentatively call these objects starburst/Seyfert composite galax- 
ies, or simply “composites.” The emission-line flux ratios for these galaxies place them 
either between the areas occupied by Seyferts and H n galaxies on the standard diagnostic 
diagrams (e.g., Veilleux & Osterbrock 1987), or, in one case, solidly in the H ii galaxy area. 
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Furthermore, most of the emission lines are very narrow (velocity widths < 300 km s _1 ), 
also typical for H II galaxies. However, all three of the composites have [O in] line widths 
substantially greater (by 100-280 km s -1 ) than those of all other lines in the spectrum. In 
addition, two of the galaxies display faint, broad wings on the Ha line. These features can 
only be interpreted as evidence for an active nucleus. The computed X-ray l umin osities for 
all three composites exceed 10 42 ergs s -1 , further indication that they harbor active nuclei. 
Examples of their spectra are displayed in Figure 15 and their emission-line properties are 
listed in Table 4. 

Our interpretation of these objects is that the H II and Seyfert components contribute, 
in varying amounts, to the optical emission lines in these galaxies, but that the Seyfert 
component is primarily responsible for the X-ray emission. Four similar galaxies were 
discussed by Veron, Veron, & Zuiderwijk (1981) and Veron et a 1. (1981), who came to the 
same conclusions regarding their composite nature. It is not clear at present whether there 
is any relation between the composite objects and 16075+2838, which has extremely broad 
[O in] lines. Due to the optical weakness of the Seyfert component in the composites, it is 
likely that different mechanisms are responsible. 

The mix of objects in our IPC/FSC sample provides an accurate preview of the un- 
classified sources we have observed (Moran et a 1. 1995) from the ROSAT / PSC sample of 
Boiler et a 1. (1992), which includes several I Zw 1-type objects (20) and starburst/Seyfert 
composites (7) in addition to classical Seyferts and star-forming galaxies. The similarity 
of the results for these two different samples confirms that our selection and study of faint 
Einstein sources has been successful and useful. 

The starburst/Seyfert composite class of galaxies represents a promising potential 
contributor to the XRB. They are appealing in this role because they axe luminous X-ray 
sources but are optically discreet about their nucleax activity. They may have gone laxgely 
unn oticed in the past because very high quality optical spectra axe required to reveal 
their subtle Seyfert characteristics. Their Seyfert components may appear weak optically 
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because they are internally absorbed or, perhaps, totally obscured from our direct view. 
If so, their intrinsic X-ray luminosities axe much higher than we observe them to be. 
Furthermore, their hard X-rays may be less attenuated than their soft X-rays, resulting 
in effectively hard emitted X-ray spectra. In this case, the composite galaxies could make 
an interesting contribution to the hard XRB. Detailed speculation about the nature of the 
composite objects is provided in Moran et a 1 . (1995). 

5. Summary 

We have presented new analysis of the Einstein Observatory IPC data base and have 
assembled a catalog of sources and fluctuations in 2520 high latitude images exceeding 2 a 
significance. Comparisons of (a) our catalog to the Einstein EMSS and of (b) the locations 
of sources on the IPC to the IPC flat field have validated our procedures for extracting 
sources at both high and low levels of significance. We have modelled the number of real 
sources in the Two-Sigma Catalog based on assumptions about the number counts/flux 
relation for sources in our flux range, the proportionality between source count rates and 
their energy fluxes, and the known vignetting and background characteristics of the IPC. 
The results of the model indicate that 28%, or ~ 13,000, of the entries in the catalog are 
real celestial X-ray sources. This is an increase of ~ 9,100 over the number of sources found 
in previous analyses of the same 2520 IPC images. Furthermore, we have established that 
the fraction of spurious sources in the catalog systematically increases at increasing off-axis 
angles on the detector. The reliability of subsamples of sources drawn from the catalog 
can thus be tuned by applying different limiting signal-to-noise thresholds to sources on 
the basis of their off-axis angles. 

We have applied filters to the Two-Sigma Catalog in the form of astronomical catalogs 
at other wavelengths (i.e., from radio and infrared all-sky surveys) to select samples of 
special classes of X-ray sources which are themselves ~ 90% reliable. Our primary objective 
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for doing so has been to search these samples for possible new components of the cosmic 
X-ray background. Followup spectroscopy of the optical counterparts of these sources 
has uncovered several fascinating X-ray sources in each sample, illustrating the merits of 
selecting X-ray sources using a variety of methods. In the radio-selected sample, we have 
detected high redshift (z m 4) quasars, which are not present in the EMSS, as well as a 
large number of very luminous X-ray sources that optically appear to be normal early-type 
galaxies. ASCA and ROSAT observations have been carried out to examine each of these 
types of objects more closely. In the infrared sample, we have discovered a Seyfert galaxy, 
16075+2838, whose unusual spectrum boasts [O ill] forbidden emission lines more than 
1000 km s -1 broader than the permitted lines. A few star-for ming galaxies with extreme 
X-ray luminosities (> 10 42 ergss -1 ) appear in the IR sample, but their identifications as 
X-ray sources need confirmation. We have also discovered three luminous X-ray galaxies 
with composite starburst/Seyfert spectra. Due to the disparity between the X-ray and 
optical properties of these galaxies, these composites might represent an important new 
component of the X-ray background. 

The Einstein IPC Two-Sigma Catalog is available on-line through the Einline service 
of the Harvard-Smithsonian Center for Astrophysics and through HEASARC at NASA 
Goddard Space Flight Center. We are very grateful to Yohan Fernando for his assistance 
with the reduction of the optical spectra. ECM and D JH acknowledge support for this work 
under NASA Astrophysics Data Program contract NAS5-32063 and from NASA grant 
NAGW-2507. RHB acknowledges partial support under the auspices of the Department 
of Energy by Lawrence Livermore National Laboratory under contract No. W-7405-ENG- 
48. This is Columbia Astrophysics Laboratory contribution number 571. 
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TABLE 2 


X-ray-Detected High-2 Quasars 


QSO 

2 

Radio Loud? 

<?x a 

Rank 

Reference 

1126 + 2342 

3.04 

Yes 

15.6 

9 

1 

1508 + 5714 

4.30 

Yes 

54.0 

4 

1 

1745 + 624 

3.87 

Yes 

23.1 

7 

1,2,3 

1759 + 6638 

4.32 

No 

1.1 


. 4 

0000 - 263 

4.11 

No 

4.3 


5 

0014 + 813 

3.38 

Yes 

67.0 

2 

5 

0130 - 403 

3.03 

No 

8.1 


5 

0420 - 388 

3.12 

No 

30.9 

6 

5 

0636 + 680 

3.17 

Yes 

12.9 

10 

5 

1208 + 101 

3.82 

Yes 

5.0 


5 

1946 + 768 

3.02 

No 

6.8 


5 

2000 - 330 

3.78 

Yes 

16.9 

8 

5 

2126 - 158 

3.27 

Yes 

170.3 

1 

5 

0041 - 2638 

3.05 

No 

4.4 


6 

0042 - 2627 

3.29 

No 

5.7 


6 

0256 - 0000 

3.36 

Yes 

7.4 


6 

2231 - 0015 

3.02 

No 

8.3 


6 

0537 - 286 

3.11 

Yes 

60.7 

3 

7 

0642 + 449 

3.40 

Yes 

43.3 

5 

7 

1402 + 044 

3.20 

Yes 

6.6 


7 

1614 + 051 

3.21 

Yes 

9.6 


7 

2204 - 408 

3.18 

Yes 

12.1 


7 


Notes: 

a Measured or estimated ROSAT PSPC count rate, xlO -3 count s -1 . 


References: (1) This work, (2) Becker et al. 1992, (3) Fink & Briel 1993, 
(4) Henry et al. 1994, (5) Bechtold et al. 1994a, (6) Pickering et al. 1994, 
(7) Worrall et al. 1987 



Infrared-Selected X-ray Sources 
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Figure Captions 

Fig. 1. — Distribution of exposure times for the 2520 IPC images used to construct the 
Two-Sigma Catalog. 

Fig. 2. — Mean number of sources detected per IPC image as a function of exposure time. 
The upper set of points (diamonds) is for all sources in the Two-Sigma Catalog. The 
lower set of points (crosses) is for the subset of sources with a > 4. The larger crosses at 
exposures above 10 4 s indicate the approximate range in each exposure time interval. 

Fig. 3. — Significances of our detections of EMSS sources compared to those published in 
the EMSS itself. One ~ 40<r source, which falls very close to the identity line, is not shown. 

Fig. 4. — (top) Location on the IPC of sources in the Two-Sigma Catalog; (bottom) the IPC 
flat field. Comparison of the two panels indicates that more/fewer sources were detected 
in places where the IPC is more/less sensitive. The white bars correspond to the IPC 
window support ribs. 

Fig. 5. — Number of lower-significance sources in the Two-Sigma Catalog as a function of 
position offset from higher significance sources in the range indicated by o\. The number in 
each offset bin has been normalized by the area (in arcmin 2 ) of the annulus that represents 
the offset range covered by the bin. The peak at small offsets in each panel is due to spurious 
“satellite sources” which result when not enough data are deleted after the detection of 
an extended or extreme-spectrum source. The la error bars indicate that the effect is not 
statistical. 

Fig. 6. — The median signal-to-noise ratio a m of sources in the Two-Sigma Catalog as a 
function of off-axis angle 9. On-axis, a m is determined mainly by targets. a m decreases 
steadily off-axis. 
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Fig. 7. — Histogram of total source-subtracted background count rates for IPC images used 
to construct the Two-Sigma Catalog. 

Fig. 8. — Distribution of source signal-to-noise ratios a for the Two-Sigma Catalog in 
different off-axis angle ranges (solid lines) and the model calculations given by equation (6) 
(dotted lines). The model, which is in excellent agreement with the data above 4<r, assumes 
the soft X-ray log iV-log S relation has a slope of 1.5 in the flux range spanned by the 
catalog. The departure of the data from the model below 4a reflects the increasing number 
of spurious sources in the catalog at lower significances. Note the departure becomes more 
severe at large off-axis angles. The results of the model suggest that ~ 28%, or ~ 13,000, 
of the sources in the Two-Sigma Catalog are real celestial X-ray sources. 

Fig. 9. — Results of the cross-correlation of the Two-Sigma Catalog with the Greenbank 6 
cm radio catalog (a) before and (b) after application of the off-axis angle-dependent signal- 
to-noise threshold described in the text. The dashed fine indicates the chance coincidence 
level in each case. The solid horizontal line is the mean chance coincidence level. The 
number in each offset bin has been normalized by the area (in arcsec 2 ) of the annulus that 
represents the offset range covered by the bin. 

Fig. 10. — Results of the cross-correlation of the Two-Sigma Catalog with the Texas 80 cm 
radio catalog (a) before and (b) after application of the off-axis angle-dependent signal-to- 
noise threshold. 

Fig. 11. — Discovery spectrum of Q1508+5714, the z = 4.30 radio- and X-ray-bright 
quasar. The strong line is Lya. 

Fig. 12 — Optical spectrum, 20 cm VLA A-array radio image, and 900 s R band optical 
image of the optically passive X-ray galaxy 2324+4222. 
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Fig. 13. — Results of the cross-correlation of the Two-Sigma Catalog with the IRAS Faint 
Source Catalog (a) before and (b) after application of the off-axis angle-dependent signal- 
to-noise threshold. 

Fig. 14. — The unusual spectrum of IRAS F16075+2838, and detail of the [O ill], H (3 
region. 

Fig. 15. — The spectrum of IRAS F08255+2630 (top), a starburst/Seyfert composite ob- 
ject, has primarily starburst galaxy characteristics. However, a close look at the [O hi], 
H/3 region (middle) reveals broad [O in] lines (FWHM = 377 km s -1 ), which must arise 
in an AGN. The spectrum of another composite object, F01445+3503 (bottom) indicates 
a weak, broad Ha feature as well, not present in F08255+2630. 
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